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Abstract. The dynamics of minority games with agents trading on different time scales is studied via dy- 
namical mean-field theory. We analyze the case where the agents' decision-making process is deterministic 
and its stochastic generalization with finite heterogeneous learning rates. In each case, we characterize the 
macroscopic properties of the steady states resulting from different frequency and learning rate distribu- 
tions and calculate the corresponding phase diagrams. Finally, the different roles played by regular and 
occasional traders, as well as their impact on the system's global efficiency, are discussed. 
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1 Introduction 

The collective phenomena that characterize the evolution 
of competitive populations of adaptive agents, such as the 
onset of cooperation or the creation of exploitable infor- 
mation, have attracted a great deal of attention from sta- 
tistical physicists over the past few years. The hope is that 
the occurrence of such macroscopic effects can be under- 
stood starting from the laws that govern the behavior of 
the individual agents. Thanks to the basic simplicity of 
its definition, the minority game (MG) [1,2] allows a full 
theoretical analysis of many of these issues. Originally, it 
was designed to mimic a market of speculators subject to 
the law of supply and demand. At each time step, traders 
react to the receipt of a public information pattern (the 
'state of the world') by either buying or selling, trying 
to profit from price fluctuations to maximize their payoff, 
and learning from experience. Upon decreasing the rel- 
ative number a of possible information patterns, the de- 
gree of cooperation in the system, measured by the inverse 
magnitude of global fluctuations, increases, until a criti- 
cal point a c is reached, below which highly cooperative as 
well as highly uncooperative states can occur, depending 
on the initial conditions of the agents' learning processes. 
Remarkably, the low a phase is characterized by the ab- 
sence of exploitable information, while in the high a phase 
the market is to some degree predictable and hence inef- 
ficient. 

On the technical level, the MG is defined by a set of 
globally coupled zero-temperature Markov processes with 
quenched disorder and without detailed balance. The sys- 
tem at a c undergoes a dynamical phase transition with 
ergodicity breaking, related to the onset of anomalous re- 
sponse. The stationary state is solvable, at least in the 
ergodic phase, via both static (replica) and dynamical 



methods, so that a very deep understanding of its macro- 
scopic properties is achievable. Moreover, the MG is open 
to a large class of economically- and biologically-inspired 
extensions, that allow to tackle such key issues as the in- 
terplay of different types of traders in a market [2]. 

An interesting modification of the original model is ob- 
tained when synchronicity is removed, and one accounts 
for the possibility that agents trade on different time scales. 
This is the 'colored' minority game (CMG) first intro- 
duced in [3] . Assuming a power-law distribution of trading 
frequencies and using the static approach of [4,5], it was 
shown that, while the phase transition picture is substan- 
tially preserved, the actual critical point a c depends on 
the particular exponent entering the power law. Further- 
more, regular and occasional traders were shown to take 
markedly different behaviors, with the latter less prone 
to change trading strategy and hence contributing signif- 
icantly less to the growth of global fluctuations. 

In this work, we adapt the generating functional meth- 
ods originally devised for spin glasses [6-10] and already 
employed in the theory of MGs [11-15] to study the dy- 
namics of the CMG in a few different cases. First, as- 
suming that the agents' learning process is deterministic, 
we derive a stochastic equation describing the behavior 
of an effective agent trading with frequency / (Sec. 2). 
Global dynamical quantities such as correlation and re- 
sponse functions appear as proper averages over the trad- 
ing frequency distribution q(f). We calculate the phase 
diagrams exactly and evaluate macroscopic order param- 
eters in ergodic stationary states explicitly in the simple 
case where q(f) is bimodal (i.e. for a mixed population of 
frequent and occasional traders, with a wide separation of 
time scales between the former and the latter, Sec. 3) and 
when it has a power-law form (i.e. when there is no sin- 
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gle characteristic time scale for trading, Sec. 4) . The roles 
of regulars and irregulars can be thoroughly investigated. 
Then (Sec. 5), we move to the case where the agents' learn- 
ing process is stochastic by introducing different learning 
rates as a further element of heterogeneity among agents 
(besides their strategies and trading frequencies). We an- 
alyze additive and multiplicative decision noise (Sec. 6), 
deriving the corresponding phase structure as a function 
of the learning rates. Finally, we summarize our results 
and formulate our conclusions (Sec. 7). In order to keep 
the treatment concise, we will not report the full gener- 
ating functional analysis and limit ourselves to a study of 
the resulting effective mean-field equations. The reader is 
from this moment referred to [11, 13] for all details about 
its derivation in similar instances. 



2 CMG with deterministic decision-making 

The basic ingredients of the MG are as follows. There are 
N agents. At each round n of the game all agents receive 
the same information pattern /Lt(n) drawn at random with 
uniform probability from a set of P possible. P is assumed 
to scale linearly with N, so that in the limit N — > oo the 
parameter a = P/N (the relative number of information 
patterns) remains finite. We assume that each agent is en- 
dowed with two different strategies (labeled by g = 1,2) 
to convert the information into a ternary trading decision: 
a ig : {1, . . . , P} 3 n — > a? g € {-1,0, 1}. a? g is the action 
prescribed to agent i by his 5-th strategy upon receipt of 
information \i. One might think that a? = ±1 correspond 
to 'buy' and 'sell', respectively, while a^ g = stands for 
'do nothing'. For all i, g and fj,, each af g is selected ran- 
domly and independently from {—1,0, 1} before the start 
of the game and fixed. Following [3], we take a probability 
distribution of the form 

P K) = \ + f^,-l + (1 - /0<*af,,0 (1) 

The numbers /, (0 < /j < 1) represent the trading fre- 
quencies of agents. They can be seen as an additional fam- 
ily of i.i.d. quenched random variables with a prescribed 
probability density which we denote by q(f). In the orig- 
inal MG, where all agents trade at all rounds, one has 
q(f) = S(f-l). 

The system's dynamics is defined through the micro- 
scopic stochastic equations that govern the decision mak- 
ing of the individual traders. Each strategy of every agent 
is given an initial valuation p ig (0) , which is updated at the 
end of every round. Loosely speaking, p ig (n) measures the 
performance of g up to round n. At the beginning of round 
n, every trader selects his so-far best-performing strategy, 
gi(ri) = argmaxpi g (n), and formulates a bid according 
to the trading decision it prescribes: bAn) = a?~ n ) x . The 
total bid (or, the excess demand) at round n is simply 
A(n) = (1/y/N) J2iLi Once A(n) is known, strat- 

egy valuations are updated according to 

Pig (n + l)= Pig (n)-a^ n) A(n) (2) 



and agents move to the next round. Strategies that would 
have prescribed the minority (resp. majority) action, i.e. 

such that af g n) A(n) < (resp. > 0), are thus rewarded 

(resp. penalized). The valuation of strategies with afj™' = 
is instead left unchanged. We assume that agents neglect 
their market impact [5]. 

Introducing the 'preferences' yi(n) = \pn(n)— Pi2(n)]/2, 
and the quantities = (an — aj2)/2, 0Ji = (an + aj2)/2, 
and fl = (1/VN) J2?=i w», (2) becomes 

1 N 

yi (n+l)= Vl (n) C (n) [^ (n) + -f=52 af n) sj (n)] (3) 

j=i 

The Ising spin Si(n) = sign[?/j(n)] G {—1, 1} is the relevant 
dynamical variable: Sj(n) = 1 (resp. —1) indicates that 
agent i has chosen strategy g = 1 (resp. 2) at round n. 

The simplest dynamical approach to the analysis of the 
stationary states of (3) is based on the 'batch' approxima- 
tion, first employed in [11]. It consists in averaging (3) 
over fj,. The result is 

N 

y i (t + l) = y i (t)-h i -J2 j *j s i (*) ( 4 ) 

where t is a rescaled time, hi = (2/\fN) fl ■ £ i and = 
(2/N) £j • This choice describes strictly speaking a 
situation in which valuation updates are made once every 
P steps or, equivalently, considering the average effect of 
all possible /x's. It is now well known, that the stationary 
state of (4) is qualitatively and quantitatively very similar, 
though not identical, to that of (3). The analysis of (4) is 
however much simpler and more straightforward than that 
of the original 'on-line' dynamics (3). 

In the limit N — > 00, the Markovian multi-agent pro- 
cess (4) can be studied a la De Dominicis [6] by introducing 
the generating functional of the dynamics, i.e. 

\ I paths 

= /p(y(o)) e iS « &(t)[!/i( * +1) ~ !/i( * ) ~ ei( * )1+!/i( * )l/ ' i( * ) x 
x e iE„ »«(*)[**+£, Jwm JJ[di /i (t)^ i (t)/(27r)] (5) 

it 

and carrying out the average over the quenched disorder 
(i.e. over the a, g 's with probability distribution (1)). This 
procedure ultimately results in a non-Markovian stochas- 
tic equation for a single effective agent whose properties 
are equivalent to those of the original A^-agent system. In 
our case, the effective agent is trading with frequency /. 
As the calculation is standard, we will limit ourselves to 
the final outcome, referring the reader to e.g. [11] for de- 
tails of a closely related case. The effective- agent equation 
turns out to be given by 

y(t + 1) = y(t) + ]T R(t, t')s(t') + 9(t) + y/Zf z(t) (6) 
t'<t 
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where R(t, t') describes the retarded self-interaction, whose 
precise form will be given below, 6(t) is an external field 
added to probe the system against small perturbations, 
and z(t) is a zero- average Gaussian noise with correlations 
(z(t)z(t')} = A(t,t'), with 



A= [(l + G)- 1 (F + C)(l + G T ) 



(7) 



We used the following notation: I stands for the identity 
matrix, with elements I(t,t') = S t t>', F denotes the matrix 
with elements F(t,t') — /, with / the average trading 
frequency; C and G are defined by their respective compo- 
nents 

C(t,f) = <*(*)*(*')>. and G (t,f) = (J^) (8) 

where the (■■■)„ average is intended over the effective pro- 
cess (6) further averaged over /: 



<■■■>. 



dffq(f)x 



n 



dy(t)dy(t) 
2tt 



g 2 /-/ tf 



£ M , y(t)Mt,t')y(t')~ 



x(-..)e 



iE t S(*)fo(t+l)-!/(*)-0(t)-E t ' R(t,t')s(t')] 



(9) 



q(f) is the common probability density of the trading fre- 
quencies. As usual, C(t,t') and G(t,t') can be identified 
with the disorder-averaged correlation and response func- 
tions of the multi-agent process: 



C (*'*') = ^Eh(^(* , )) P ath s ' 



dis 



d6i(t') 



(10) 
(11) 



dis 



(12) 
(13) 



For simplicity of notation, we set 

C(t,t') = J fq(f)C(t,t'\f)df 

G(t,t')= J fq(f)G(t,t'\f)df 

Finally, the retarded self-interaction kernel has the form 

R^-a/O + G)- 1 (14) 

For q(f) = 5(f — 1) the equations for the standard 'batch' 
MG [11] are immediately recovered. 

Making for the asymptotic behavior of C and G the cus- 
tomary assumptions [16] of time-translation invariance, 



lim C(t + T,t) = C(r) 

t— >OQ 

lim G(t + r,t) = G(t) 



finite integrated response, 

lim V Git, t') < oo 



(15) 
(16) 

(17) 



and weak long-term memory 



lim t') = Vt' finite 

t — ► oo 



(18) 



ergodic stationary states of the dynamics can be fully char- 
acterized in terms of a few parameters. These are, in par- 
ticular, the persistent autocorrelation 



c= lim iVC(i) = f fq(f)c(f)df 



(19) 



with c(/) = lim r _ >(X) (l/r) ^2 t<T G(t\f) the autocorrela- 
tion of the effective trader (trading with frequency /), 
and the susceptibility (or integrated response) 

X = lim E G W= f MfMfW (20) 

t<T 

where %(/) = lim,--^ J2t<r G(t\f) is the effective trader's 
susceptibility. 

Following [11], one can easily derive from (6) an equa- 
tion for the long-time behavior of the rescaled variable 

y(t) = y(t)/t-. 



= --fs + z, 



1 



(21) 



where y = lim^oo y(t), while s and z denote time-averages 
of s(t) and z(t), respectively. As in the standard MG, part 
of the agents will 'freeze' at s = ±1 and will end up using 
just one of their strategies (for these, \y(t)\ oo asymp- 
totically in such a way that y remains finite), whereas the 
remaining agents will keep flipping between their strate- 
gies (for these, y(t) remains asymptotically finite and y is 
zero). It is easy to see that agents are 'frozen' for \z\ > 7 
while they are 'fickle' (with s = z/7) for \z\ < 7. The 
asymptotic values of the interesting quantities can hence 
be immediately derived by separating the contributions 
of frozen and fickle agents. For example, denoting by ( ) 
an average over the Gaussian r.v. z with zero mean and 
variance (from (7)) (z 2 ) = (/ + c)/(l + x) 2 an d defining 



0(/) = <0(|*|-7)> = l- erf 



(22) 



i.e. the fraction of agents trading with frequency / that are 
frozen (strictly speaking, the probability that the effective 
agent is frozen), one has 



if) - <* 2 ) 



*|-7)> + ((*/7) 2 0(7-M)> 



= Hf) + 



Kf) 2 Kf) V tt 

witM(/) = l-0(/) = <0( 7 -|*|)) and 



(23) 



A(/) 



7 



t'<t 



7+~c 



(24) 
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Analogously, x(f) can be calculated from the formula 

7V^/X(/) = W7-W))=^(/) (25) 

which is derived by noticing that the external field and 
the noise term enter (6) in the same way, apart from the 
\faf factor. The calculation of ds/dz is then trivial and 
leads to (25), which in turn gives 



Xif) 



l + * crf A(/) 



V2 



(26) 



Inserting (23) and (26) into (19) and (20) one gets 
two equations for c and x which must be solved self- 
consistently. Solutions will clearly depend on a and on 
the underlying frequency distribution q(f). It is the pur- 
pose of the next two sections to study these equations and 
the corresponding solutions in two interesting cases. 



3 Bimodal frequency distribution 

As a start, let us consider the simplest case, where 

<?(/) = (1 - q)6(f - 1) + qS(f - f ) (27) 

with < q < 1 and < fo < 1, describing a mixed 
population formed by frequent traders, who buy or sell at 
every round, and occasional traders, who instead have a 
finite probability 1— fo of taking no action. The population 
fractions of the two types are (1 — q) and q, respectively. 
The total fraction of frozen agents is evidently 



=(1- g)^(l) + #(./o) = 
A(l) 



erf 



V2 



erf 



A(l) 



erf 



A(/o) 



(28) 



where A(/) is given by (24) with / 
(19) is given by 



C = (1 - q)c(l) + qfoc(f ) 



V2 V% 

1 - q(l - f ). Eq 



(29) 



with c(f) given by (23), while, finally, the susceptibility 
turns out to be expressed as 



X = (l-?)x(l)+?/ox(/o) 



(30) 



with x(f) given by (26). 

Equations (29) and (30) must be solved self-consistently 
and are valid as long as none of the assumptions (15-18) 
is violated. In the MG, ergodicity breaking is related to 
the onset of anomalous response, i.e. to a divergence of x- 
Working out (30) explicitly, one finds the relation 



ax 
1 + X 



= (!-?) erf 



A(l) 
V2 



q erf 



A(/o) 
V2 



(31) 



X is evidently finite for large a. The critical values of the 
control parameter where x diverges are immediately ob- 
tained as 

a c (q, fo) = (l-q) evi(x) + q erf(z^) (32) 
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Fig. 1. Phase diagram of the multi-frequency MG with bi- 
modal frequency distribution. The curves a c vs fo (trading 
frequency of occasional traders) are shown for different values 
of the fraction q of occasional traders. For a > a c the system 
is ergodic. The horizontal dashed line at a ~ 0.3374 marks the 
position of the critical point of the standard MG and coincides 
with the critical line in the case q = 0. 



where x solves (29) at a c . After some algebra, the latter 
condition is shown to be, in terms of x, equivalent to the 
following: 



2/ - (1 - q) erf(x) - qfo evi{x^/f )+ 



e -/o* a 
7T 



(33) 



For a > a c , the system is ergodic and admits a unique 
stationary state. At a c , the integrated response diverges 
and anomalous response sets in. For a < a c , finally, er- 
godicity is broken and the long-time limit of macroscopic 
quantities depends on the initial conditions of the dynam- 
ics. Clearly, the limiting cases q = and fo = 1 reproduce, 
as they should, the results of the standard MG, while the 
limit q = 1 brings us back to the standard case after a 
trivial re-scaling. Notice that, as usual in MGs, x diverges 
when a equals the fraction 1 — <f> of fickle traders. 

Numerical solution of (33) and subsequent insertion of 
the result into (32) leads to the (fo , a) phase diagrams 
shown in Fig. 1 for different values of q. For sufficiently 
small fo, the critical point a c decreases as q increases, 
signaling that the ergodic phase gets larger while the ef- 
ficient phase shrinks. As fo grows, a c tends instead to 
the standard MG value 0.3374 . . . for any q, as was easily 
predictable. 

Solving instead (28), (29), and (30) one obtains the 
behavior of the macroscopic order parameters and of the 
fraction of frozen agents as a function of a upon varying 
q and fo- These quantities give an idea of the interplay 
between the two types of agents. For sakes of conciseness 
and because we believe it is the most interesting instance, 
we concentrate on the case where the trading frequency 
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Fig. 2. Fraction <f>(f) of agents with trading frequency / that 
are frozen for values of q as reported in the figures. The vertical 
dashed lines give the positions of the critical points, below 
which (17) is violated and the theory is no longer valid. The 
continuous (resp. dashed) line represents 0(1) (resp. 0(0.05)). 



of occasional traders is much lower than that of regulars, 
that is we assume regulars and occasional traders operate 
on widely different time scales. We take in particular /o = 
0.05. Results for </>(/) are shown in Fig. 2. It is clear that 
occasional traders are more likely to freeze than regular 
traders, as also argued in the static solution outlined in 
[3]. This tendency is robust with respect to changes of q 
and /o- It is interesting to notice that a larger fraction of 
occasional traders freeze when regulars outnumber them. 

In order to test our predictions, in particular for what 
concerns the phase transition, and to understand further 
the combined effects of regulars and irregulars on global 
efficiency in this simple case, we can study the volatility 
matrix, defined as [11] 



G)- 1 (F + C)(I + G 



T\-li 



(34) 



- has two important properties. First, the magnitude of 
global fluctuations, i.e. a 2 = (A 2 V. , which serves as a 

° \ / time' 

measure of global efficiency (the smaller is a 2 , the more 
efficient is the allocation of resources, that is, the smaller 
the amount of resource that is wasted), is given by 



Fig. 3. Behavior of H and a 2 vs a at fixed /o = 0.05 for 
q = 0, 0.2, 0.4, 0.6, 0.8 (continuous curves, top to bottom). The 
vertical dashed lines mark the positions of the critical points. 
In the non-ergodic regime a < a c the theory is no longer valid. 
Markers denote the results of computer simulations of a system 
of 256 agents for the same values of fo and q, averaged over 
100 disorder samples. 



is efficient, in the sense that it does not create information 
an external agent could exploit to have a gain) is roughly 
given by the persistent part of E. Both these properties can 
be proved under general conditions, as done for instance 
in [11]. 

H is easily seen from (34) to be given by 



H = 



f + c 
2(1 + X? 



(37) 



and tends to f/2 for a — > oo. a 2 can instead be approxi- 
mated with 



a 2 = 



f + J fq{f)(j>(f)df 1 
2(1 + X) 2 2 



1 - \ fq(f)<t>(f)df 



(38) 



a 1 = lim E(t,t) 

t — *oo 



(35) 



Second, the quantity H = (1/P) p (^l/ i )time' wnere 

1 



(A\u) 



= lim 

L — >co 



L — L e 



that quantifies the informational efficiency of the system 
(when H = the minority action is not predictable on the 
basis of the state of the world fi(n) alone and the system 



and tends to / for a — > oo. The proof of (38), which re- 
quires an approximate evaluation of (35), is a slight modi- 
fication of the standard case, reported in [11], and we will 
not report it here. For the present model, we have obtained 
a 2 and H from numerical simulations (performed with flat 
or unbiased initial conditions yi(0) = for all i). The com- 
parison with analytical predictions can be seen in Fig. 3. 
As in the standard MG, the ergodicity breaking transition 
A(n)S ll (^ .„ (36) coincides with a transition between an informationally ef- 
ficient phase with H = (a < a c ) and an informationally 
inefficient one with H > (a > a c ). H vanishes at at 
a c , where % — > co, and decreases as q increases. At the 
same time, a 2 decreases in vicinity of the critical point, 
signaling an increased degree of cooperation. 
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4 Power-law frequency distribution 

Let us now turn our attention to the case where 

K 



q(.f) = «/"- 



/ 



K+l 



(39) 



where the frequency distribution is scale-free (i.e., there is 
no single characteristic time scale) . The standard instance 
is recovered in the limit k — > oo, in which q(f) — > S(f — 1). 
This problem was treated statically in [3] , so we will limit 
ourselves to calculating the phase diagram and discussing 
its qualitative properties. The general dynamical solution 
is now slightly more complicated than the bimodal case, 
however it is still possible to write down closed equations 
for the relevant parameters. The behavior of macroscopic 
observables turns however out to be similar to the bimodal 
case. Let us first notice that, since 



1-</> = kJ .T^erf 
and since, from (26), 

i + x 



V2 



df 



(40) 



(41) 



we immediately obtain the expression for the critical line 
signaling ergodicity breaking in the (k, a) space, namely 



a c (K) = k / r- 1 erf(z V 7)d/ (42) 
Jo 

where again x solves the equation for c, namely 

c = kJ f K c(f)df (43) 

at a c . One can see by a short calculation that in terms of 
x and for a — a c , Eq. (42), equation (43) reads 



2k 



1 



lo 



f erf(xV7) + ^= e- fx2 



df = 
(44) 

When k — > oo, the standard picture is recovered. Using a 
more compact notation, (44) be re-cast as 

" [ 2 - erf(aO] + T ^ x 2 m - F(k, x; 1/2) = (45) 

where we introduced the shorthand 
F(k, x; a) = ^(^D^p [^(k + a) - r u (n + a, x 2 )] (46) 

with r(-) and _T"(-, •) the Euler Gamma function and the 
upper incomplete Gamma function, respectively. (45) can 
be solved numerically for x upon varying k. Subsequently, 
the corresponding value of a c can be calculated from (42): 
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Fig. 4. Phase diagram of the multi-frequency MG with power- 
law frequency distribution. The curve a c vs k is shown. For 
a > a c the system is ergodic and informationally inefficient. 
The horizontal dashed line at a ~ 0.3374 marks the position 
of the critical point of the standard MG. 



The resulting phase diagram is shown in Fig. 4. For large 
k, the standard MG picture is recovered, as expected. As 
k decreases, a c also decreases, proving that the fraction of 
active agents is also getting smaller and smaller. a c finally 
tends to zero when n — > 0. While this picture is quali- 
tatively identical to that derived in [3], there are certain 
numerical differences as in our solution a c (n) seems to ap- 
proach the standard limit a c (oo) ~ 0.3374 slightly faster 
than in the replica solution. However a direct comparison 
could be misleading since the order parameters are defined 
differently in the two cases. This point might deserve fur- 
ther investigation. The fraction <p of frozen agents behaves 
similarly to what shown in Fig. 2, in that agents trading 
less frequently are more likely to be frozen, as also dis- 
cussed in [3] . Clearly, the formulas derived in the previous 
section for a 2 and H can be applied, mutatis mutandis, 
also in this case. 



5 CMG with finite learning rates 



We now add one layer of difficulty by introducing a learn- 
ing rate in the individual agents' dynamics of strategy se- 
lection, as first done for the standard model in [17]. Up to 
now, agents have strictly followed their best performing 
strategy at each round, as encoded in the deterministic 
rule 



Si(n) = sign[2/j(n)] 



(48) 



a c {n) = erf(x) F{k, x; 1/2) 



(47) 



Now we account for the possibility that at each time step 
the value of Si(n) is established by a probabilistic rule. 
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The simplest choice for the latter is 1 

Prob{s 4 (n) = ±1} ~ e ±r *^(") 



And again, we skip all formalities of the derivation of this 
process and focus on its long-time properties. The analog 
(49) of (6) in this case is 



which turns (48) into 

Si(n\Q(n), r<) = sign[y 4 (n) + Q{n)/r t ] (50) 

where the Ci's are i.i.d. Gaussian r.v.'s with probability 
density p(Q = [1 — tanh 2 (()]/2 (zero average, unit vari- 
ance). The positive constants i~i can be interpreted as 
'learning rates', upon varying which one interpolates be- 
tween the deterministic rule (48), that is recovered for 
Ji — > oo, and a fully randomized rule, corresponding to 
Ji = 0. The dynamics of the standard model with finite 
learning rates was tackled in [13], where the phrase 'ad- 
ditive decision noise' was introduced to describe the situ- 
ation of (50). Notice that this type of noise affects fickle 
traders only, because for 'frozen' traders |&/i(n)| — > oo, 
hence it is to be expected that the critical points obtained 
for different trading frequencies do not change. A more 
complicated situation is that of 'multiplicative decision 
noise', first introduced in [18], where instead of (50) one 
has 

*i(n|Ci(n), I}) - sign[ 2/i (n)(l + Q (n)/^)] (51) 
which corresponds to 



Prob{s 4 (n) = ±1} - e 



±risign[j/i(n)] 



(52) 



This choice clearly affects also frozen agents. 

The dynamical theory of this case is a straightforward 
modification of that constructed for the deterministic case. 
In general, with s(n\Q(n), the dynamics (4) becomes 



y l {t + 1) = yi{t) - hi 



N 



J ij s j (t.\<: j (t)J- j ) (53) 



Again, one can write down its generating functional, tak- 
ing into account the fact that this time the calculation 
of one-step transition probabilities, the product of which 
is used to compute averages over paths, requires carrying 
out an average over the Q's (we denote this operation by 

He): 



paths 

= J ' p(y(0)) e [ ^'^ i{t)[vi{t+1) ^ v ^ t ' )+hi ^ 0i{t)]+v ' {t)4 '' {t) x 
x e iE«&(t)E 3 ^^(t|0(t),^)] U[di/i(t)dfc(i)/(27r)] 

** it 

(54) 

Again, averaging over the quenched disorder leads to a 
non-Markovian process that now describes a single effec- 
tive agent trading with frequency / and learning at rate r. 

1 We use for the learning rate the customary symbol F. The 
reader is warned that it is different from the r function ap- 
pearing in Sec. 4. 



l(t + 1) = y(t) + J2 R(t, t')s(t'\at'),r) + 9(t) + ^fz(t) 



t'<t 



(55) 

(7) and (14) are still formally valid, but the correlation 
and response functions C(t, t') and G(t, t') are defined re- 
spectively as 



C(t,t') = (s(t\at),r)s(t'\at'),r))^ 



G{t,t') = 



d 



d0{t>) 



(56) 
(57) 



where the average (•••)_, formerly given by (9), involves 
now also averages over z and _T. In particular, if we denote 
by w(r) the probability density of _T, we have 



(...) 



. = J o dffq{f)J"dr w (r) I [] 



dy(t)dy(t) 



2tt 



(..•)e 



x e 



iE f y(t)[y(t+i)-v W-eW-Ef R(t,t')s(t'\c(t'),r)] 



(58) 



As before, C(t, t') and G(t, t') represent the disorder-averaged 
correlation and response functions of the original system. 

Proceeding as done in Sec. 2 by assuming the validity 
of (15-18), one can derive an equation for the quantity 
y = lim^oo y(t)/t, which reads 



y 



-'ym(r) + z 



(59) 



where all quantities have the same meaning as in (21) 
except that now m(r) is the time-average of s(t\((t), T). 
This change presents no additional difficulty. In the next 
section we will analyze the stationary states and the phase 
diagrams resulting from (50) and (51) separately. 



6 Additive and multiplicative decision noises 

In the case of (50), one immediately sees that the condi- 
tions for an agent to be frozen or fickle are identical to 
those given in Sec. 2, hence the picture emerging from 
different trading frequencies (in particular the expressions 
for x, c and (f>) is left unchanged since r can be integrated 
out. In particular, the phase diagrams presented in Sec. 3 
and 4 for bimodal and power-law frequency distribution 
remain valid. Hence, as in the standard batch MG, the 
introduction of additive noise does not affect the macro- 
scopic properties of the multi-frequency model in the er- 
godic regime. (On the other hand, the introduction of r 
does affect the stationary states of the standard model in 
the non-ergodic regime, as first shown by computer simu- 
lations in [17] and by analytical arguments in [19].) 
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Finally, we turn to the case of multiplicative decision 
noise (51). Following [13], it is convenient to switch to 
the inverse learning rate T = l/T and to introduce the 
quantity 

h(T) = J sign(l + T()p(()d( (60) 

which has the obvious properties h(T) E [0, 1] with h(0) = 
1 and h(oo) = 0. Using this, one concludes from (59) 
with m(r) = m(T) = sign[y]h(T) that frozen agents with 
sign[y] = ±1 have m(T) = ±h(T), respectively, and occur 
for \z\ > -fh(T), whereas fickle agents have m(T) = z/j 
and occur for \z\ < jh(T). 

In order to make contact with the quantities calculated 
in the deterministic decision-making case, we compute the 
autocorrelation of the effective trader with trading fre- 
quency /, c(/), which can as usual be obtained by sepa- 
rating the contributions of frozen and fickle agents, this 
time performing an additional average over T. For sim- 
plicity, one can pass from the variable T with probability 
density w(T) to h, Eq. (60), with probability density 



For the sake of brevity, we specialize the above the- 
ory only to the simplest cases where both / and T have 
binomial distributions: 

<?(/) = (i-«W/-i) + g*(/-/o) (67) 

w{T) = (1 - e)S(T) + eS(T - T ) (68) 

with q, fo and e real numbers belonging to the interval 
[0, 1]. This describes a mixed population of frequent and 
occasional (trading frequency fo) traders, with either de- 
terministic decision making or stochastic decision making 
with learning rate Jo = 1/Tq. Notice that trading fre- 
quency and learning rate are treated as independent vari- 
ables (this seems to the author a rather strong and possi- 
bly unrealistic assumption). Under these hypotheses, one 
has 

P (h) = (l-e)5(h-l) + eS(h-h ), h = h(T ) (69) 
Introducing again A(/) = « / and setting 



p(h) = J™ 5 h- J sign(l + T()p(()d( 



w(T)dT (61) c(/, h) = h 2 



1-erf 



A(/> 



so that 



+ 



1 



c(f) = J [(e(\z\ - 7 h)h 2 ) + (e(jh - M)(z/ 7 ) 2 )] P (h)dh 

(62) 

where (• • •) denotes an average over the Gaussian r.v. z 
with zero average and variance (z 2 ) = (/ + c)/(l + x) 2 , 
where 



A(/) 2 
one finds, for c, 



erf 



V2 

Kf)h 
V2 A(/) V 7T 



+ 

h 



-\{f) 2 h 2 /2 



(70) 



fq(f)c(f)df 



(63) 



is the global persistent autocorrelation. The fraction of 
frozen agents reads 

4> = j dfq(f) j dhp(h) - 7 fc)) (64) 

where it should be kept in mind that 7 depends on / 

(7 = v^7/(i + x))- 

An expression for the total susceptibility \ can be 
found starting from (25) and averaging properly over / 
and h: 



ax 
1 + X 



1 



j dfq(f) j dhp(h) (6(\Z\ -7/!)) = !- 



(65) 

so that as usual \ diverges when the fraction of fickle 
agents equals a. We have thus come to the following ex- 
pression for the critical line: 



a c [q(f),w(r)} 



dfq(f) J dhp(h) erf (xh^J) (66) 



where we have emphasized that a c now depends on the 
trading frequency distribution q(f) and on the distribu- 
tion of learning rates w(r) (via w(T) and p(h)). Again, 
the value of x must be obtained by solving (63) at a c . 



C = (1 - q)(l - e) c(l, 1) + (1 - q)e c(l, h ) + 

+ foq(l - e) c(/ , 1) + foqe c(f , ho) (71) 

and for a c , which now depends on the population fractions 
q and e and on / and ho, 

a c = (1 - q)(l - e) erf(x) + (1 - q)e evf(xh )+ 

+ q(l-e) erf(x yffo) + qe evi{xh yffo) (72) 

When a = a c , (71) takes a simple but rather lengthy ex- 
pression which we do not report. Ultimately, all equations 
can be solved numerically. As already done in Sec. 3, we 
show the phase diagram for the case where regular and oc- 
casional traders operate on very different time scales and 
again choose fo = 0.05 (see Fig. 5). One can clearly see 
that the phase structure of the MG with multiplicative 
noise [13] is preserved, the main effect being a shift of the 
position of the critical lines in the (a, Jo) space to lower 
values of a (signaling again a reduction of the informa- 
tionally efficient phase). 



7 Summary and discussion 

In summary, the MG with agents trading at different fre- 
quencies and learning at different rates has been ana- 
lyzed using gcncrating-functional techniques. The dynam- 
ical critical behavior of the model is not altered by the 
introduction of multiple time scales for trading, but the 
specific phase diagram turns out to depend on the details 
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a a 



Fig. 5. Phase diagram of the multi-frequency MG with 
stochastic decision-making, with bimodal distributions for fre- 
quencies and learning rates, with parameter fo — 0.05. Results 
are shown for q — 0.25 (left panel) and q — 0.75 (right panel). 
In each panel, continuous curves denote the critical lines for 
multiplicative decision noise for e — 0.2, 0.4, 0.6, 0.8, 1 (right to 
left). The dashed vertical line marks the position of the critical 
point for the CMG with deterministic decision-making, which 
coincides with the critical line for the corresponding CMG with 
additive decision noise. 



of the underlying frequency and learning rate distribu- 
tion. We have derived the phase structure for a system 
of agents with deterministic decision-making when the 
frequency distribution is a simple bimodal and when it 
has a power-law form. For the case of stochastic decision- 
making, i.e. finite learning rates, the phase diagram has 
been computed explicitly for a double bimodal distribu- 
tion of frequencies and learning rates, describing fast and 
slow traders with a heterogeneous distribution of stochas- 
ticity in their learning process. In accordance with [3] , we 
found that occasional traders display a stronger tendency 
to freeze, namely to use just one of their strategies, than 
regular traders, thus contributing and additional signal 
to unfrozen players. To the latter, thus mostly to reg- 
ular traders, is ultimately ascribable the building up of 
fluctuations (via freezing) and thus the decrease of global 
efficiency. 

On a purely technical level, it is clear that combin- 
ing static and dynamical methods of statistical mechanics 
of disordered systems leads to a very satisfactory under- 
standing of the the macroscopic properties of MG-based 
market models. On the other hand, MGs are simple enough 
to hope they can serve as the elementary building blocks of 
more complicated and realistic agent-based market mod- 
els, with different types of agents that are heterogeneous 
in strategies, beliefs, learning rates, market-impact evalua- 
tion, etc., and where the plethora of macroscopic phenom- 
ena observed in real systems could be analyzed and traced 
back to the individual laws of motion. Present works con- 
cerning the macroscopic properties of different versions of 



the MG should be seen as the necessary preliminary work 
to develop a full understanding at later stages. 

Indeed, several directions are open for further research. 
For example, it would be interesting to relate r and / by 
letting agents with a smaller learning rate trade more (or 
less) frequently. This could also be done by assuming that 
r and / are not independent qualities, as done in Sec. 6 
of the present work, but come with a joint probability 
density. Another worthy extension could involve the in- 
troduction of (using the terminology of [20]) 'producers', 
i.e. agents who have just one strategy at their disposal. It 
is likely that they could again affect the phase structure. 
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